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Yu, Bin 

<120> Compositions and MeVhods for the Synthesis and Subsequent 
Modification of Uridine - 5 -^iphosphosulf oquinovose (UDP-SQ) 

<130> MSU-04769 

<160> 21 

<170> Patentln version 3.0 
<210> 1 
<211> 1134 
<212> DNA 

<213> Cyanobacterium Synechococc\is 
<220> 

<221> CDS 
<222> (1) . . (1134 
<400> 1 

atg cgc ate get etc ttt acc gag acg t 
Met Arg lie Ala Leu Phe Thr Glu Thr 
15 10 

ate gtc acg egg ctt egg cac acg gtc gat Vac ctg cag cgt ctt ggc 
lie Val Thr Arg Leu Arg His Thr Val Asp Ms Leu Gin Arg Leu Gly 
20 25 \ 30 

cac acc gtc atg gtt ttt tgc ccc gac ggc ggA etc cgc gag cac aag 
His Thr Val Met Val Phe Cys Pro Asp Gly GlyALeu Arg Glu His Lys 
35 40 \ 45 

999 9 ct c 9 a 9 tc tat 999 9 tt: aaa 99 c tt:t cc 9 < % a cc 9 ctc tat ccc 
Gly Ala Arg Val Tyr Gly Val Lys Gly Phe Pro L^u Pro Leu Tyr Pro 
50 55 6( 

gag ctg aag eta get ttt ccg ttg ccg aaa gtg gga\aaa gee ttg gag 
Glu Leu Lys Leu Ala Phe Pro Leu Pro Lys Val Gly V*ys Ala Leu Glu 
65 70 75 \ 80 

egg ttc egg ccc gac ctg ate cac gtg gtc aat ccg g&t gtg ttg ggg 
Arg Phe Arg Pro Asp Leu He His Val Val Asn Pro AlV Val Leu Gly 
85 90 \ 95 

ttg ggc ggc ate tac tat gee aag gcg eta aat gtg cca ttc gtg gcg 
Leu Gly Gly He Tyr Tyr Ala Lys Ala Leu Asn Val Pro Deu Val Ala 
100 105 l\0 



48 



96 



144 



192 



240 



288 



336 
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tec tat cac acc cat ttg ccg aaa tac ctt gag cat tac ggg ctg ggg 384 
Ser Tyr His Thr His Leu Pro Lys Tyr Leu Glu His Tyr Gly Leu Gly 
115 120 125 

gtc ttg gag ggg gtg etc tgg gaa ttg ctg aag ctg gcg cat aac caa 432 
Val Leu Glu Gly Val Leu Trp Glu Leu Leu Lys Leu Ala His Asn Gin 
130 135 140 



gca gcg ate aac etc tgt act tea acc gcg atg gtg cag gag ctg aca 
Ala Ala lie Asn Leu Cys Thr Ser Thr Ala Met Val Gin Glu Leu Thr 
145 150 155 160 



480 



gat cac ggc att gag cac tgt tgc etc tgg cag cga gga gtg gat acc 52 8 

Asp His Gly He Glu His Cys Cys Leu Trp Gin Arg Gly Val Asp Thr 
165 170 175 

gag acc ttt egg cca gac ttg get act get gcg atg cgc gat cgc etc 576 

Glu Thr Phe Arg Pro Asp Leu Ala Thr Ala Ala Met Arg Asp Arg Leu 
180 185 190 

a 9 fc 99 c 99 fc aa 9 ccc act 9 C 9 ccc tt: 9 tt: 9 ctc tac 9 tc 99 a C 9 C ctc 624 

Ser Gly Gly Lys Pro Thr Ala Pro Leu Leu Leu Tyr Val Gly Arg Leu 
195 200 205 

pa, tea gee gag aag caa ate gat cgc ctg cga ccc att ttg gat gee aat 672 

1^ Ser Ala Glu Lys Gin He Asp Arg Leu Arg Pro He Leu Asp Ala Asn 
ifi 210 215 220 

M 

« cct gag get tgc ttg gee ttg gtc ggc gat ggc ccg cat egg gee gaa 72 0 

Pro Glu Ala Cys Leu Ala Leu Val Gly Asp Gly Pro His Arg Ala Glu 
225 230 235 240 

led 

m eta gag caa ttg ttt get ggc acc cag acg cag ttc att ggc tat ctg 768 

L=£ Leu Glu Gin Leu Phe Ala Gly Thr Gin Thr Gin Phe He Gly Tyr Leu 

^ 245 250 255 



f*» cat ggg gaa cag eta ggg gcg gee tac get tct get gac get ttt gtc 816 

U His Gly Glu Gin Leu Gly Ala Ala Tyr Ala Ser Ala Asp Ala Phe Val 

L 260 265 270 

03 ttt ccc tec egg acc gaa acc ctc ggt eta gtc ttg ctg gaa gee atg 864 

P Phe Pro Ser Arg Thr Glu Thr Leu Gly Leu Val Leu Leu Glu Ala Met 

F* 275 280 285 

M 

gca gcg ggt tgt ccg gtc gtg gcg gee aat tec ggt ggc att ccc gat 912 

Ala Ala Gly Cys Pro Val Val Ala Ala Asn Ser Gly Gly He Pro Asp 
290 295 300 

att gtc age gac ggc att aat ggt ttc ctg ttc gat cct gag gat gaa 960 

He Val Ser Asp Gly He Asn Gly Phe Leu Phe Asp Pro Glu Asp Glu 

305 310 315 320 

caa ggg gcg ate get gcg att cag cgc ttg ttg get aac cct gca gag 100 8 

Gin Gly Ala He Ala Ala He Gin Arg Leu Leu Ala Asn Pro Ala Glu 
325 330 335 

cgc gag att eta cgc caa gcg get cgt caa gaa gee gaa cgc tgg age 1056 

Arg Glu He Leu Arg Gin Ala Ala Arg Gin Glu Ala Glu Arg Trp Ser 
340 345 350 
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tgg aac gca gcc acg cgc caa etc cag gac tac tac tgc gag gtg ttg 
Trp Asn Ala Ala Thr Arg Gin Leu Gin Asp Tyr Tyr Cys Glu Val Leu 
355 360 365 

gca gat ggt tgc tta ccc tta gcg gcc tga 
Ala Asp Gly Cys Leu Pro Leu Ala Ala 
370 375 



<210> 2 
<211> 377 
<212> PRT 

<213> Cyanobacterium Synechococcus 
<400> 2 

Met Arg lie Ala Leu Phe Thr Glu Thr Phe Leu Pro Lys Val Asp Gly 
15 10 15 

lie Val Thr Arg Leu Arg His Thr Val Asp His Leu Gin Arg Leu Gly 
20 25 30 



His Thr Val Met Val Phe Cys Pro Asp Gly Gly Leu Arg Glu His Lys 
35 40 45 



Gly Ala Arg Val Tyr Gly Val Lys Gly Phe Pro Leu Pro Leu Tyr Pro 
50 55 60 

Glu Leu Lys Leu Ala Phe Pro Leu Pro Lys Val Gly Lys Ala Leu Glu 
65 70 75 80 



Arg Phe Arg Pro Asp Leu lie His Val Val Asn Pro Ala Val Leu Gly 
85 90 95 



Leu Gly Gly lie Tyr Tyr Ala Lys Ala Leu Asn Val Pro Leu Val Ala 
100 105 110 



Ser Tyr His Thr His Leu Pro Lys Tyr Leu Glu His Tyr Gly Leu Gly 
115 120 125 



Val Leu Glu Gly Val Leu Trp Glu Leu Leu Lys Leu Ala His Asn Gin 
130 135 140 

Ala Ala He Asn Leu Cys Thr Ser Thr Ala Met Val Gin Glu Leu Thr 
145 150 155 160 

Asp His Gly He Glu His Cys Cys Leu Trp Gin Arg Gly Val Asp Thr 
165 170 175 
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Glu Thr Phe Arg Pro Asp Leu Ala Thr Ala Ala Met Arg Asp Arg Leu 
180 185 190 



Ser Gly Gly Lys Pro Thr Ala Pro Leu Leu Leu Tyr Val Gly Arg Leu 
195 200 205 



Ser Ala Glu Lys Gin lie Asp Arg Leu Arg Pro lie Leu Asp Ala Asn 
210 215 220 



Pro Glu Ala Cys Leu Ala Leu Val Gly Asp Gly Pro His Arg Ala Glu 
225 230 235 240 



Leu Glu Gin Leu Phe Ala Gly Thr Gin Thr Gin Phe He Gly Tyr Leu 
245 250 255 



His Gly Glu Gin Leu Gly Ala Ala Tyr Ala Ser Ala Asp Ala Phe Val 
260 265 270 



Phe Pro Ser Arg Thr Glu Thr Leu Gly Leu Val Leu Leu Glu Ala Met 
275 280 285 



Ala Ala Gly Cys Pro Val Val Ala Ala Asn Ser Gly Gly He Pro Asp 
290 295 300 



He Val Ser Asp Gly He Asn Gly Phe Leu Phe Asp Pro Glu Asp Glu 
305 310 315 320 



Gin Gly Ala He Ala Ala He Gin Arg Leu Leu Ala Asn Pro Ala Glu 
325 330 335 



Arg Glu He Leu Arg Gin Ala Ala Arg Gin Glu Ala Glu Arg Trp Ser 
340 345 350 



Trp Asn Ala Ala Thr Arg Gin Leu Gin Asp Tyr Tyr Cys Glu Val Leu 
355 360 365 



Ala Asp Gly Cys Leu Pro Leu Ala Ala 
370 375 



<210> 3 

<211> 2979 

<212> DNA 

<213> Arabidopsis thaliana 
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<400> 3 
ctacacgtta 

aacatgaact 

ctttgtcgtt 

gatctctctt 

atctccaggg 

aaccacaagc 

ccagtctatg 

gaaattgttt 

taattgtaat 

ggcacgggcc 

tgtctccgac 

atcgccttgt 

ctctctttaa 

agataacaat 

agcaatccga 

attcgataca 

agctccaaac 

tcgccattac 

tttttaatgc 

ttgagaacgg 

ttgattagct 

agcttgtaca 

agatcttttc 

cctgataaaa 

aagaaggaaa 

ttttaccaac 

cttagacaaa 

agggacgtgt 

agcaccaaag 

ccagttcact 

tccgggagat 



ccttccggta 
ttctttttcc 
gctgctctcc 
gtttcacggt 
ttgaacaaaa 
tcaatatagg 
attgaataaa 
tttcaaccat 
tcagataatt 
gcgacgacag 
tctgatggca 
aacgttccag 
gaaaacgaaa 
atggccaatc 
gcttcaggta 
ccacttaaaa 
tcttttctac 
tgttaattac 
atataaaaga 
aaacggggat 
tcaatatatg 
aaaatgtaat 
caatggcagc 
gcatattcaa 
ttttgacatt 
caactaaaag 
aattttttta 
gtgtggtaag 
acctaaaaat 
ggtcttgcta 
gaagcgtgta 



ctggaaacag 
tccagaacca 
aatcatattt 
cgtgcagtaa 
atccggtttt 
ttgacccata 
gtttcagttc 
ttcggttcgg 
cactgacctg 
gaagtcctga 
tcacaaacac 
tgaaaaccgc 
cagataaaca 
tttaacaaaa 
atttgtccat 
gaagtgagtc 
gccaatgcga 
aaaatattaa 
aacgtatatt 
tgaagctttc 
aaaaaaagaa 
gtatgatctt 
agaaggaact 
gtttagtttc 
tgaaaagcgg 
tgtatcttgg 
taacaacatt 
acattactat 
tttagtcagg 
atgttttagt 
taatgtcagg 



tcgtttaatc 
aatcgctgca 
ctcggtctct 
agttctcagt 
tccctcctga 
agaacaatca 
ggttacagct 
ttgatatgct 
atcttcaggg 
agacattgct 
atctccactt 
tggcattccg 
aaattacaat 
ctagtacttg 
tacactgcac 
cagtttatac 
cctacatgaa 
acatcaagat 
ctattcttgc 
tgaatcgaca 
gaagaaaatg 
tcttcacctg 
aatgtaagat 
atattataca 
gttttacata 
gatgtatctg 
gtgaaatgag 
tggtacagat 
gaaaaaagag 
aatgtgaatt 
cttaaaccgt 
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aaccaattga 
ctgtactgtt 
tctcttgccg 
tttgtcacgc 
tgaaaacatc 
atgcaagatc 
cgcttataag 
catcaatatg 
attatatcag 
tcaagaacca 
gcgtaagctt 
gtaaacaact 
gttgttgact 
tatggtccat 
acaaatttct 
aaaattctaa 
tcactagtgg 
tagcgtggaa 
ctcagtctta 
cccttattcc 
taagttttga 
cagttgcacc 
cagccgctct 
tcacaataaa 
cttattatag 
cacatcaaca 
gcagaaaaag 
agcatttttg 
agtcaagttc 
cttgagggat 
gcaatttcag 



ttggtcccaa 
cattgcgtat 
cttttccaat 
aatcttcaac 
agaatcagaa 
attttgtgta 
aaaattggca 
gtttggcagt 
ggattccacc 
caaggccaag 
gtgagagttc 
tctcaagatc 
agaaatcttc 
ctccaatgaa 
caatatcaaa 
cctctttaaa 
cttttctggt 
agtatcattg 
tacgcatttc 
aaagtcgaag 
acaatcatag 
agctgctata 
gtgaaggaac 
ccagaaaaag 
accacattgg 
caagcttaat 
gtacttacac 
ctattgccag 
tggattctct 
ttaccataac 
agatgattct 



5 

Q 

IT. 

jassls 
□ 

S <210> 4 

□ <211> 2537 

Q 



tggactaagc 


gcaagcgaga 


gtggaacctt 


4— 4— a* a* 4— _^ _ 4_ _ _ 

ttggtaataa 


ggacaaggga 


agctacagaa 


±y z u 


gagaagaaga 


aattagcgat 


attaccaaat 


agagaacatc 


cagtgagtaa 


actaaatggt 


1980 


gctaccttct 


tgatccaatg 


actctggctc 


cataaaactc 


ttcaggaaca 


ccttcatgtg 


2 040 


tcgtcacgac 


tacaacctat 


gaagcaaaaa 


agttattaaa 


aaaaaaaaaa 


aagg aacay c. 




taacacttgt 


caagtaattc 


taattctgga 


aacagttact 


tatgagctga 


Ctgaaoayat 


zibu 


acttaagttg 


aagaatgaga 


tagtaaaaga 


agaaacctcg 




^•i ™ ^» rtf* m rift i~ ^ 

cacggaggua 


9 9 9 0 


tctaatgaaa 


ttctggaatc 


tgtttttgca 


gccggataca 


tagctgcaac 


aaaatcaaag 


o o q n 


agagaatcac 


ttccaataat 


aacatgacat 


ataataaaag 


cttttggtca 


atggatcggt 


O "3 fl 


gattccgaga 


atcttgggat 


attcacaact 


aaaatctgac 


aactttgact 


caaacaaatc 


Z4UU 


ctgaatgtaa 


ttggttttaa 


cgatctacta 


tataatttgc 


taaattggtg 


gtgtagcaaa 




4- 4- « _ 4- n m _ 4- 4- 

ttcatacatt 


agcgagtatc 


4- « 4- 4- A 4— Ann 

tcttcataaa 


ataaatgtaa 


cgatcaaatc 


gaaagaaaaa 


oenn 
ZDZU 


aacattacag 


gaaaaagtca 


accaaggaaa 


aaaatgagta 


gaat ctgt t t 


ucacagagac 


neon 


atttcgtcga 


acacaaaaca 


agcaaaaaaa 


gaacactgtg 


aagaagactt 


acgcaaaggg 


9 c a n 


agaaggctca 


acaaagagag 


_ _ _ 4— 4- _ 4- — _ 4- 

caattctcct 


aggcttagag 


agcgactcag 


gatcgagaag 


o i r* d 


cggcgcatca 


atctccgatt 


cgtcatcttc 


tctgacttga 


gtaatagtca 


tatcgttgga 


Z / D U 


cccagaaaca 


gcttcctttg 


taatcactcc 


acagaagcgg 


agcttgcttt 


tcttgctaat 


2820 


gggtaatctc 


cggtgaccaa 


acgaaagagg 


ggaatgaaga 


acaaaagaag 


aagacctggg 


2880 


aggagaacaa 


gaggttgcag 


aggaagaaga 


acaagtgtta 


gtcgtgctag 


gaagcaaatg 


2940 


aggaggtata 


gagagattta 


tagaagaaag 


agtcgtcat 






2979 



<212> DNA 
<213> Arabidopsis thaliana 
<400> 4 



tcatattttg 


aaaaagcact 


tcttcttctt 


cttcttcttc 


ttcttcttct 


tcttcttccc 


60 


atcaaaaagt 


gctagtggca 


agaccgcgca 


tattccagta 


acaaagttta 


aattcaaaac 


120 


tatggaacac 


aatttgtccc 


atgttttgtt 


gectgaatta 


aacgaccata 


tcacttggtt 


180 


cggccacttc 


ateteggata 


cgcacaagcg 


gttgtcaagg 


ttgtaaaaga 


cgattttgta 


240 


aggatctaca 


ttggcaaagg 


gagctttaca 


gacgacttga 


aaagtttcgg 


tgtgaagatt 


300 


gaaaggtaaa 


actttggttt 


ctttgcactc 


ggtgaaccaa 


taaagegace 


catctagata 


360 


cacaggtgcg 


gggtaagcag 


caatccgata 


aggagcagcg 


agagtgacat 


acctccaagc 


420 
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p 

ru 



j 

t z 

tssf 

08 



attggtacta 


aagtcgaaaa 


ettegcatgt 


ageagegcuu 


4- 4— ^ /— i +~ ^3 

ttCaggccta 


4- 4- 4- 4- /->r /—i -a a 
LLLCUyCaya 


f± o u 


gttgtataac 


caaaeggget 


tgtatgtgcc 


cctgaatttg 


4- n(- 4> f> n /«i /-i/~r ^ 

LCLLLdCCya 


aLCCaayCal 


Oft u 


aaagaatttg 


cgt ttaagee 


tgtagtaacc 


atetegtaag 


tcgat cauga 


/-rf- 4- 4- 4- 4- 4- — 

yut.t.t.t.ycit.a. 


DUU 


atcgcaaaga 


ggaagaggee 


gataccatct 


agtggtcggg 


ccgaccacat. 


a a f~if~i at* ht - 
aaCCyydl. 1. 1 


oou 


gtcgtgattg 


taaagacaaa 


egagacegt c 


acaac uac La 


t.y tyaaac t.o. 


ay t cn_cty Lett 




i_ 4_.— 4-__4_4— — . 

gttatccttc 


t c c c aaggag 


t agggatct t 


gaaegt. tg at 


gauyaaccca. 


clCt.yC.cia.L.yL. 


/ o u 


tcttaaagat 


tctatggtcg 


ggt t tataac 


atggtgagga 


aacacayaca 


CCaLCdyaaL 




_ 4- _ 4_ .— 4- — - 4- 

atctggacct: 


ectgattget 


gacgatgttt 


caacuaaaat 


tyCaLaCatC. 


LttdUULLdL 


u u 


taaaaagata 


— i —> 4- — ,4-4-4- 

aatatcattt 


j_ i_ i_ — < — i 4- 4- 

tgttggaatt 


ggaaaacaaa 


acccaaaaga 


c aa c 1. 1. 1. aga 


you 


tatgtaaggg 


ucgcauatgt 


at ttgattac 


attgetatga 


gcat.at.gac c 


z^i -—i /^i 4— 4~ 4" /*^* "^i 


i n 9 n 


ataatgaaaa 


aaataaagat 


gu ugagucc t 


accaatxaag 


/~r/~T ^ ^ ^ ^ +* a O 

yyCaaaLaCa 


dddttLttyd 


1UOU 


teat taatac 


aaagaaaaag 


cct ctcat aa 


gccaaagcca 


4— ^ 4- 4~ <^ 4— 


cy LaaaLciLL 




tccgcagcat 


actctttaca 


cttccttcct 


ctctccgcca 


— s 4— 4— m 4— « /^i 

aLCtccccgc 


ccclll ciy t. L- 


i 9 n n 

X £* \J V 


acagcaacct 


ccataaccgc 


egtcaaaget 


tccacat t tg 


gegcaaacat 


daaCCCaddC 




tcatcattca 


ccactatagt 


n 4- n 4— 4-4- — i 4- *-» 

cctctt tatg 


cttgcgtacc 


tagatgecat 


cactggttta 


i ion 


ccactcaaca 


tcgcttccat 


taacgttaaa 


tcaagacctt 


gtggtctaag 


cgccggau uc 


J.JOU 


acgaacaaat 


cgatcccgtt 


gtagaaaccc 


tt gage teat 


c tgggc ccag 


ay at.ee caaa 


i a a n 


atggaaactt 


tttcccctaa 


txece ugcaa 


cgtxgc t ccc 


4- /I /-( /-t 4— / -y — 

dtyyccctya 


LCCdyCLdCt 


IJUU 


acgaggtaaa 


catttgaata 


cgt ttggatt 


attttcgega 


aagctt cgaa 


gagcaauyya 


IjDU 


i— 4— — — 4— 4- 4— _ _ j_ 

tgtcctttgt 


ctttgactaa 


tctcccagca 


get cc taaaa 


caauege tya 






ggtaacccta 


n 4- 4— 4- 4— -~ m « 4- 

attttgacct 


aaacagagta 


cgt age 1 1 c t 


tgtctgatgt 


gaat ccgt t c 


1 ran 


tcgtcgactc 


cattgaggat 


cacatgaacc 


cttttctcag 


ggat t tggt a 


aaegtetc t t 


i *7 a n 

J. / ft u 


~ « n 4_ 4— 4. « 4_ M 
agcatttctc 


cgcagctatc 


getgatageg 


atgtggtgag 


cgtagttgtg 


gaagaat ctg 


i q ri n 


— i 4- 4- 4- n j__4- Ann 

atttegtcca 


gtatcttggg 


aagcaeggea 


cegtatagae 


ttgcattgaa 


gcccugcgau 


lODU 


ctcggtccgc 


ctggtttacg 


gatcaggtct 


tggtaaatac 


ccgat ugt aa 


yCtCuCLddC 


i Q9n 


geaatgeegt 


gecaggatae 


agegaggt 1 1 


ggaacctccc 


gggegatcca 


gugaggcaaa 




gcaacacttt 


cagagtgaac 


cgcatcgaaa 


— 4-4-m4-4-4-/-i4- 

ggt tctctct 


4—^4-4-4-4-^4-4-^ 

CyLULtCutC 


tuyy Ldaayc 


9 n a n 

ZU1U 


tcccatgcct 


tgttgtaccg 


ccattttccg 


ggctccgcgt 


cgccgtgaga 


atggattata 


2100 


gggtaaatga 


tttggtcgga 


aacagggggg 


attttgttgg 


tttcagggga 


ttggtctaag 


2160 


ggagaggtga 


aaacgtggac 


acggtgtcca 


eggegagcta 


aggcggtgta 


tagagtgaag 


2220 


gcgtggcgtt 


ccatgccgcc 


aggattggga 


cccgtgggcc 


atttceggga 


gaaaacagcg 


2280 
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agttttagtg 


i_ i_ i_ 4_ ______ — — _ — _ 

ttttgggagg 


agggttggtt 


aaggagaaat 


caaggcggtx 


C C dy y C a. a. a. U 


01 ACS 
Z j f± U 




tgggcggttt 


ggaggtcgcc 


agaccacggt 


ggctggtttg 


tgtcagagga 


ggaggaggag 


2400 




actgcagcgg 


tggaggagga 


gcaagtggag 


gtgcggagga 


gaaagagagc 


cggtatggta 


2460 




aagaggacgg 


tgaagaagag 


gaaagtacaa 


aggctaaaat 


gagaattagg 


tttcttgagt 


2520 




ttggtttgtg 


aagccat 










2537 




<210> 5 
















<211> 2369 














<212> DNA 
















<213> Arabidopsis thai i ana 












<400> 5 
ttaaggtctc 


atgcatttga 


ccagaacatc 


gacyaaccyt 


L. U y LaLaLaL. 


y L LyL L L.L.CIL 


60 




gtacattttc 


tcaaccattt 


tgcgtccttc 


^ /~1 4" /—I /~i « j^i •— \ 4— 

aCCCCCCaaC 




/~</~T 4- 4- frfr a | 4- 4- 

Ly t_v_. uyy du i_ 


12 0 




cctaagtaga 


tacaagagat 


tatgagctaa 


4_4__4_.1_4-.-_.4-J- — 

tccutcgtLa 


****** 4— /— < 4— /"I 

ccggaccccc 


CCaUtydy ty 


i ad 

lOU 


yS 


aagtagacca 


gtcatgttgt 


gttgaaccat 


i_ i_ 4_ 4- ._-.__i_ i_ 

cccucuggut 


cctcctycac 


/~* 4— 4— /— > /— » /— « a /-i 

CtyLLCCCaC 


7 AO 


CSO, 

w 


cactgcaagt 


ccataagcca 


ttgcttcgat 


tgtcactcta 


ccaaaugu u t 


CdCCaaCt.CC 


7 n n 


feO 

0 


ctgtaaaaaa 


tcaaggttca 


aaataaataa 


accgaacaaa 


a ccgaaa egg 


LCaadCCyad 


n 

J D U 


f M 


atcaaaacct 


acattttgaa 


caaaaacaaa 


ac tgaaatgt 




aCCdddutyd 


4 !? f) 
1 z u 


0 


aatcaatcac 


aaattggttt 


atttgattca 


gtaat. ccaay 


+— +- /—i /—i 4- --j +- n n n 


dL l_y d L-dddd 


480 




ctaaaccaaa 


acctaactgt 


aaactacttt 


ctagtggaaa 


actgcatatg 


m 4~ /^-i "*\ ^% 4~ -™i 

caucaaacaa 


C/iri 


O 


tgtttttgag 


gtgtgaggag 


gattacctgg 


gagtttgtaa 


cgtagacatc 


izgcugcagag 


(Tflfl 
wUU 




tataatgaag 


caacacgggt 


tgttgcagga 


gtccacatta 


ccgacttaga 


+— * y"«4- 4— 4>- /-» 

caagc ccccg 




ctgtttgaca 


agaagcttaa 


catctcttta 


acgtat ccaa 


— x- a- 4- — r f. 4- — 4- 

CLLtgCuyCt 


CCULyadCCC 


/ z u 




acggatccta 


aaagaacttt 


aagttcttgc 


4-4-,— 4- / ~4-,'-i4-4-/-i 


4— 4— -^i /lat-t- 
L. LdayCCat L- 


y uoctcty uy uy 


780 




agagaaacac 


ttttcatctg 


tcgtgatgaa 


cc t ct t aaac 


^ 4- /— r 4— 4* 4™ f% 4/— 


ggd.gdyd.LCa 






accttttctt 


tcctaatgat 


ccccttgtga 


4- 4- 4- 4- 4- — > y_» 

ttcctttgag 


attcttgtcc 


4— /^i 4~~ /^i 4— r~i /~v *^ 

ccccccagaa 


on p. 




agagccaagg 


caatagattc 


aaggagaaga 


agtxgtccct; 


4- 4- j— •+- 4- nnnf 4- 

tugtxgggtt; 


caugcugcca 


q c fi 

Sow 




agagacatca 


caagcatatc 


tgaatctgtt 


attcctaact 


ctgttctcac 


tgattcgcgt 






aatatttgtc 


tcttcaccct 


cattttctct 


ggtgaaagtg 


ttggtgtgtt 


gagtgaagaa 


1080 




ggaatccccg 


ctacaaaagc 


taactcatca 


ttaacagata 


gtggaacaat 


cactggttgt 


1140 




gatctaagct 


ttatatgctc 


ctcctcgcac 


catgttaacc 


attgtctgct 


ctgtgattca 


1200 




gataagaaaa 


tcagcatttt 


cactcggtca 


agaactggtt 


tcgctcgatc 


aaagtattct 


1260 
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cgtcgattct 


ccattatcca 


ccaagctatt 


ugacuuccac 


cage uggaug 


a f - <t a t/ 1" pa h r^f 

dLydttcdty 


1 jZU 




tattgatctg 


cgagaaaagg 


aaaaaaaaaa 


ay LaaCLddy 


4— --3 4- /-* t~* 4- 4— 4- 

dCatyCL UUL 


LL-LLydLLyL 






aaattagtag 


tctaaaaact 


gcataatgaa 




tccaUyayy l 


ClL.GL^ClL> Uy V L 


1 44 n 




gatcctgcga 


tgatcaaatc 


+" /"I 4— 4— /~1 *™4 ^ 

tigcc u ccacg 


yCaytCttya 


a /™t /"*• fra fr 4- 4- 

ayctyayt ll* 




1500 




tcaacaactt 


tgacccttct: 


cctactgagc 


cccL.gcat.ca 


^ +- j-i ^ /-^ /-i 4- /-I 4— 

auccaccL.cc 


/— i r-i 4- /~i 4- 3 /"ja 

CCL.yL-t.cia.yci 


1 JOU 




actactgcag 


agactgttgc 


accacagctc 


aaaagctctg 


aagecage cc 


a +~ ^» a +■ a rra a 
C ci t. C d L. ciy d d 






attggagcac 


cagtcattga 


tagctcgtgg 


aaaagcagga 


cgaat c t cc t 


4— /-w *~i t~-% « a a a ^ a 

ugaccaaaca 


1DOU 




aggcgtttaa 


aatctgattt 


ccugucgcaa 


ytCCCayaLC 


4— 4- ^-i 4- — \ 4- « « /-ffr 

LtCLatycyy 


yL-t-CLrClL.t.L-d 


T 740 




agaactttat 


cctctagtga 


cccaaaggga 


CCadyddyCL 


LdCLaLaayL 


dy L.dLLL.yLL. 


1 fifif) 

lOUU 




aatggaagtt 


gtggat ct tg 


CL.caucat.cc 


4— /~1 f-* 4- 4— /— *r 

aaat-CCLLdy 


4- /—i 4- /—i —j —j j— r +- -J « 

L. L- L- k^ctcty LcLL. 


dd L L. 


1860 




accttctgct 


taacaeggat 


cttactacga 


gaagtccgga 


CayCLLLLCL 


fi/~1 4" 4— 4— /^i 4/~ *~t/~i 

Ly tCLLLLyL 


j. y \j 




ttggaactca 


agctccgtcg 


agaaacccca 


tCaUCLLLCC. 


4— /if *-\ 4— /-i ^ q f-\ 4— 

tyaCCayaCt 


a a a t~ (~* rrt" t~* 
aaCaLLLy LL 


1 QRD 

X -7 O U 




ctcttattcg 


aaccagcatc 


atctttacca 




ccaaagcc uc 


^ /™T 4— 4— y^-l 4— /*! 2 

dyadLLCLCa 


9 ri4 n 


t.8 5 
SWT 


ttagcaacaa 


catccagtcc 


u tuaatcttc 


tccatatata 


act cat cct t 


LCtcyycCLy 






cctccaaacc 


gtaaaaactc 


aactttgett 


tcattatcat 


gtgcccacct 


agactgaaca 


2160 


3 

0 


t aaaacccaa 


gacacgtcca 


--\ /-^ /— f 4— *~i 4— 

aaycyLaatc 


a a a anr'anpr' 
aaaayCayCL 


ctd L dctdL^Cctct 


yy LdL Ld 


2220 


m 


cgaaaccact 


gaaacgctcc 


tcctccacca 


tgacctctac 


geggagtect 


accagaatac 


2280 


0 


actctaggtg 
ctcagcggcg 


taccccttgg 
ataaccgaat 


agtagacctc 
ctcctccat 


cctgacagtg 


aagacttaac 


acnugucugt. 


O O Q 


1QSOO 


<210> 6 
<211> 179S 


J 












<212> DNA 
















<213> Arabidopsis thaliana 












<220> 
















<221> CDS 
















<222> (170) . . (1603) 














<400> 6 
gtcgacccac 


gcgtccgctc 


atctctcatc 


gttcegggag 


aagagaagag 


agacccatcc 


60 




ctcacttcaa 


agttcaaagt 


ctcgaaggat 


cttctccaac 


tctctctaaa 


caagattcca 


120 




aattttcaaa 


ggtgaatttg 


tttgatagaa 


tcaagaacaa 


acctttaaa atg gcg cat 
Met Ala His 
1 


178 




eta ctt tea get tea tgc cct tea gtt ate tea 


ctt age age age age 


226 
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Leu Leu Ser Ala Ser Cys Pro Ser Val lie Ser Leu Ser Ser Ser Ser 
5 10 15 

age aag aat tea gtt aag ccg ttt gtt tea ggg cag acc ttc ttc aat 
Ser Lys Asn Ser Val Lys Pro Phe Val Ser Gly Gin Thr Phe Phe Asn 
20 25 30 35 

get cag ctt ctt tea aga tct tct etc aaa gga ctt etc ttc caa gag 
Ala Gin Leu Leu Ser Arg Ser Ser Leu Lys Gly Leu Leu Phe Gin Glu 
40 45 50 

aag aaa ccg aga aaa age tgc gtt ttc aga gca act get gta cct ata 
Lys Lys Pro Arg Lys Ser Cys Val Phe Arg Ala Thr Ala Val Pro lie 
55 60 65 

acc caa caa gca cca ccc gaa aca tct acc aat aac tea tec tct aaa 
Thr Gin Gin Ala Pro Pro Glu Thr Ser Thr Asn Asn Ser Ser Ser Lys 
70 75 80 

cca aag cgt gtt atg gtc att ggt gga gat ggt tat tgc ggt tgg get 
Pro Lys Arg Val Met Val He Gly Gly Asp Gly Tyr Cys Gly Trp Ala 
85 90 95 

act get etc cac ttg tec aag aag aat tac gaa gtt tgc att gtt gac 
Thr Ala Leu His Leu Ser Lys Lys Asn Tyr Glu Val Cys He Val Asp 
100 105 110 115 

aac ctt gta aga cgt ctt ttc gac cac cag ctt gga ctt gag tea ttg 
Asn Leu Val Arg Arg Leu Phe Asp His Gin Leu Gly Leu Glu Ser Leu 
120 125 130 

act cct att gee tec att cat gac cga ate age cga tgg aag get ttg 
Thr Pro He Ala Ser He His Asp Arg He Ser Arg Trp Lys Ala Leu 
135 140 145 

aca ggg aaa tea att gag ttg tac gtt ggt gat ate tgt gat ttc gaa 
Thr Gly Lys Ser He Glu Leu Tyr Val Gly Asp He Cys Asp Phe Glu 
150 155 160 

ttc tta get gag tct ttc aag tct ttt gag ccg gat tea gtt gtc cac 
Phe Leu Ala Glu Ser Phe Lys Ser Phe Glu Pro Asp Ser Val Val His 
165 170 175 

ttt ggg gaa cag aga tec get cct tac teg atg att gac egg tec aga 
Phe Gly Glu Gin Arg Ser Ala Pro Tyr Ser Met He Asp Arg Ser Arg 
180 185 190 195 

gca gtt tat aca cag cac aac aat gtg att ggg act etc aac gtt etc 
Ala Val Tyr Thr Gin His Asn Asn Val He Gly Thr Leu Asn Val Leu 
200 205 210 

ttt get ata aaa gag ttt gga gag gag tgt cat ctt gta aaa ctt ggg 
Phe Ala lie Lys Glu Phe Gly Glu Glu Cys His Leu Val Lys Leu Gly 
215 220 225 

acg atg ggt gag tat gga act cca aat att gac ate gag gaa ggt tat 
Thr Met Gly Glu Tyr Gly Thr Pro Asn He Asp He Glu Glu Gly Tyr 
230 235 240 

ata acc ata acc cac aac ggt aga act gac act ttg cca tac ccc aag 
He Thr He Thr His Asn Gly Arg Thr Asp Thr Leu Pro Tyr Pro Lys 
245 250 255 
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i 



p 



o 



caa get age tec ttt tat cat ctt age aaa gtt cat gat teg cac aac 994 
Gin Ala Ser Ser Phe Tyr His Leu Ser Lys Val His Asp Ser His Asn 
260 265 270 275 

att get ttt act tgc aag get tgg ggt att aga gec act gat etc aac 1042 
He Ala Phe Thr Cys Lys Ala Trp Gly He Arg Ala Thr Asp Leu Asn 
280 285 290 



caa gga gtt gtt tat gga gtg aag act gat gag aca gag atg cat gag 
Gin Gly Val Val Tyr Gly Val Lys Thr Asp Glu Thr Glu Met His Glu 
295 300 305 



~5 egg gtc ttc aac caa ttt aca gaa cag ttt tea gtc aat gaa ctg get 

Q Arg Val Phe Asn Gin Phe Thr Glu Gin Phe Ser Val Asn Glu Leu Ala 

*S 375 380 385 



^ 405 410 415 



1090 



gaa etc cgt aac cga ctg gat tac gat get gtg ttt ggt aca gca ctt 1138 
Glu Leu Arg Asn Arg Leu Asp Tyr Asp Ala Val Phe Gly Thr Ala Leu 
310 315 320 

aac egg ttc tgt gtg caa get get gtt ggt cac cca ctt aca gtt tat 1186 
Asn Arg Phe Cys Val Gin Ala Ala Val Gly His Pro Leu Thr Val Tyr 
325 330 335 

ggt aaa ggt ggt cag acg aga ggc tac etc gat ata aga gac acg gtt 1234 
Gly Lys Gly Gly Gin Thr Arg Gly Tyr Leu Asp He Arg Asp Thr Val 
340 345 350 355 

caa tgt gtt gag ate get ata gca aac ccg gca aaa get ggt gag ttc 12 82 

Gin Cys Val Glu He Ala He Ala Asn Pro Ala Lys Ala Gly Glu Phe 
360 365 370 



1330 



tea etc gtc act aaa gcg ggt tea aag ctt ggg eta gac gtg aaa aag 1378 
Ser Leu Val Thr Lys Ala Gly Ser Lys Leu Gly Leu Asp Val Lys Lys 
390 395 400 

atg acg gtg cct aac ccg aga gtg gag gca gaa gaa cat tac tac aac 1426 
Met Thr Val Pro Asn Pro Arg Val Glu Ala Glu Glu His Tyr Tyr Asn 



gca aag cac act aag ctg atg gaa ctt gga ctt gag cct cac tat eta 1474 
Ala Lys His Thr Lys Leu Met Glu Leu Gly Leu Glu Pro His Tyr Leu 
420 425 430 435 

tct gac tea ctt ctt gat teg ttg etc aac ttt get gtt cag ttt aaa 1522 
Ser Asp Ser Leu Leu Asp Ser Leu Leu Asn Phe Ala Val Gin Phe Lys 
440 445 450 

gat cgt gtg gac acg aaa caa ate atg cct agt gtt tec tgg aag aag 1570 
Asp Arg Val Asp Thr Lys Gin He Met Pro Ser Val Ser Trp Lys Lys 
455 460 465 

att ggc gtc aag act aag tec atg ace aca taa agtgcagacc aatattacac 1623 
He Gly Val Lys Thr Lys Ser Met Thr Thr 
470 475 

ataaggagag attatgaaag agatgatgtg ttgtttggta tcttcaaact teatttctge 1683 

aaaagacttg ctaggcttaa gaggttttgt ccatattaca ttgtgcaggt tctttaatgt 1743 

tagatcttaa tttcgatgaa aaaaaaaaaa aaaaaaaaaa aaaaaagggc ggcege 17 99 
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<210> 7 
<211> 477 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 7 

Met Ala His Leu Leu Ser Ala Ser Cys Pro Ser Val lie Ser Leu Ser 
15 10 15 



Ser Ser Ser Ser Lys Asn Ser Val Lys Pro Phe Val Ser Gly Gin Thr 
20 25 30 



Phe Phe Asn Ala Gin Leu Leu Ser Arg Ser Ser Leu Lys Gly Leu Leu 
35 40 45 



Phe Gin Glu Lys Lys Pro Arg Lys Ser Cys Val Phe Arg Ala Thr Ala 
50 55 60 



Val Pro lie Thr Gin Gin Ala Pro Pro Glu Thr Ser Thr Asn Asn Ser 
65 70 75 80 



Ser Ser Lys Pro Lys Arg Val Met Val lie Gly Gly Asp Gly Tyr Cys 
85 90 95 



Gly Trp Ala Thr Ala Leu His Leu Ser Lys Lys Asn Tyr Glu Val Cys 
100 105 110 



lie Val Asp Asn Leu Val Arg Arg Leu Phe Asp His Gin Leu Gly Leu 
115 120 125 



Glu Ser Leu Thr Pro lie Ala Ser lie His Asp Arg lie Ser Arg Trp 
130 135 140 



Lys Ala Leu Thr Gly Lys Ser lie Glu Leu Tyr Val Gly Asp lie Cys 
145 150 155 160 



Asp Phe Glu Phe Leu Ala Glu Ser Phe Lys Ser Phe Glu Pro Asp Ser 
165 170 175 



Val Val His Phe Gly Glu Gin Arg Ser Ala Pro Tyr Ser Met lie Asp 
180 185 190 



Arg Ser Arg Ala Val Tyr Thr Gin His Asn Asn Val lie Gly Thr Leu 
195 200 205 
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Asn Val Leu Phe Ala lie Lys Glu Phe Gly Glu Glu Cys His Leu Val 
210 215 220 



Lys Leu Gly Thr Met Gly Glu Tyr Gly Thr Pro Asn lie Asp lie Glu 
225 230 235 240 



Glu Gly Tyr lie Thr lie Thr His Asn Gly Arg Thr Asp Thr Leu Pro 
245 250 255 



Tyr Pro Lys Gin Ala Ser Ser Phe Tyr His Leu Ser Lys Val His Asp 
260 265 270 



Ser His Asn lie Ala Phe Thr Cys Lys Ala Trp Gly lie Arg Ala Thr 
275 280 285 



Asp Leu Asn Gin Gly Val Val Tyr Gly Val Lys Thr Asp Glu Thr Glu 
290 295 300 



Met His Glu Glu Leu Arg Asn Arg Leu Asp Tyr Asp Ala Val Phe Gly 
305 310 315 320 



Thr Ala Leu Asn Arg Phe Cys Val Gin Ala Ala Val Gly His Pro Leu 
325 330 335 



Thr Val Tyr Gly Lys Gly Gly Gin Thr Arg Gly Tyr Leu Asp lie Arg 
340 345 350 



Asp Thr Val Gin Cys Val Glu lie Ala lie Ala Asn Pro Ala Lys Ala 
355 360 365 



Gly Glu Phe Arg Val Phe Asn Gin Phe Thr Glu Gin Phe Ser Val Asn 
370 375 380 



Glu Leu Ala Ser Leu Val Thr Lys Ala Gly Ser Lys Leu Gly Leu Asp 
385 390 395 400 



Val Lys Lys Met Thr Val Pro Asn Pro Arg Val Glu Ala Glu Glu His 
405 410 415 



Tyr Tyr Asn Ala Lys His Thr Lys Leu Met Glu Leu Gly Leu Glu Pro 
420 425 430 



His Tyr Leu Ser Asp Ser Leu Leu Asp Ser Leu Leu Asn Phe Ala Val 
435 440 445 

Gin Phe Lys Asp Arg Val Asp Thr Lys Gin He Met Pro Ser Val Ser 
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450 455 460 



Trp Lys Lys lie Gly Val Lys Thr Lys Ser Met Thr Thr 
465 470 475 



<210> 8 

<211> 1209 

<212> DNA 

<213> Cyanobacterium Synechococcus 



<400> 8 





gtgaagat t c 




uyy cy dt-yy u 


ttLtycyyy l. 


yy^L>L> y v_ y 


t"r , t*r , aat - tt"cr 


60 




gccgctgcag 


ycCaCyCCy u 


/*■» ^ /™i /™i a +- i - fit* +~ 
CaCCattyCL 


y cii_cict u ^-y 






120 




gaattggggg 


uucaguccct. 


/~i ^ s-i 4— /"i/r ■a 4~ 

CaCLCCyaCC 


y cyacyau uy 


aaL-yui-yy i_ l. 


y dciy y t»y y 


180 




caagaaacgg 


gcgggcagcc 


gat cage uuu 


» 4~ 3 2 f- r 1 h ppf 
ytUaatCLwy 


ctLL l ay ^ y y 


L.y cl l« La^yau 


240 


g 


cgcctctgtg 


cactactgct 


agaaacgcag 


ccyyatycya 


L-L-yuy^-ctL, i_ l. 


uyi_^yc*.cn— ■ a y 


300 


S3 
P 


cgcgccgccc 


cctattcaat 


gaagagtgca 


tggcacaagc 


ft f~i ¥~ 4- y-1 a /"t/TiTt" 

yctLCacyy t. 


t_ del ddv— dd\_ 


360 




gtcaacgcca 


cccacaatct 


get ctgcgcc 


Lgtgcggaug 


4— rrirfr 1 piraa 

uoyyt— <— cid 




420 


0 

py 


gtccacttgg 


gcaccatggg 


cgtctatgga 


taeggtagee 


ategegggge 


tacgattcct 


480 




gaaggctact 


tagaagtgga 


agtcgtccag 


c 999 at 99 cc 


aacgetttga 


agagaagatt 


540 


,U 


cttcaccccg 


ttgatccggg 


tagegtctat 


cacatgacca 


agacgctgga 


tcaattgttg 


600 


□ 


ttctactact 


acaacaagaa 


cgacaacatc 


caagtcaccg 


accttcacca 


gggtattgtc 


660 


09 


tggggcacga 


acaccgatca 


ctgtaatctc 


cacccggatc 


tgaccaatcg 


gttcgactac 


720 


q 
S 


gacggtgatt 


acggcacagt 


cttgaaccgc 


ttcttgatgc 


aggeggegat 


cggctatccc 


780 




ttgactgtgc 


atggcgttgg 


tggccaaacc 


cgagccttca 


tccacattcg 


cgactcagtg 


840 




cgctgcgtcc 


aactggcgat 


cgaaaatccg 


ccagcagcca 


atgaaaaagt 


ccgcatcttt 


900 




aaccagatga 


cggaaaccta 


ccaagtcaag 


gatttggcag 


agaaagtggc 


ggcattgacc 


960 




ggtgctgaaa 


tcgcctacct 


gcccaatcca 


cgcaaggaag 


cccttgagaa 


cgacttgatt 


1020 




gtcgacaacc 


gctgcttgat 


tgatttaggc 


ctcaatccga 


ccaccttgga 


caatggcctg 


1080 




atgagcgaag 


tggtagaaat 


tgcgcagaag 


tttgecgate 


getgegateg 


cgecaaaatt 


1140 




ccctgcgttt 


ctgcctggac 


ccgtaatcaa 


gctgaagctc 


tcagcgctcc 


tgaaaccget 


1200 



ctgcgctaa 
<210> 9 
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<211> 402 
<212> PRT 

<213> Cyanobacterium Synechococcus 
<400> 9 

Met Lys lie Leu Val Leu Gly Gly Asp Gly Phe Cys Gly Trp Pro Cys 
15 10 15 

Ala Leu Asn Leu Ala Ala Ala Gly His Ala Val Thr lie Val Asp Asn 
20 25 30 

Leu Val Arg Arg Lys Thr Asp Val Glu Leu Gly Val Gin Ser Leu Thr 
35 40 45 

Pro lie Ala Thr lie Glu Arg Arg Leu Lys Ala Trp Gin Glu Thr Gly 
50 55 60 

Gly Gin Pro lie Ser Phe Val Asn Leu Asp Leu Ala Ala Asp Tyr Asp 
65 70 75 80 

Arg Leu Cys Ala Leu Leu Leu Glu Thr Gin Pro Asp Ala lie Val His 
85 90 95 

Phe Ala Glu Gin Arg Ala Ala Pro Tyr Ser Met Lys Ser Ala Trp His 
100 105 110 

Lys Arg Phe Thr Val Asn Asn Asn Val Asn Ala Thr His Asn Leu Leu 
115 120 125 

Cys Ala Cys Val Asp Val Gly Leu Lys Ser His lie Val His Leu Gly 
130 135 140 

Thr Met Gly Val Tyr Gly Tyr Gly Ser His Arg Gly Ala Thr lie Pro 
145 150 155 160 

Glu Gly Tyr Leu Glu Val Glu Val Val Gin Arg Asp Gly Gin Arg Phe 
165 170 175 

Glu Glu Lys lie Leu His Pro Val Asp Pro Gly Ser Val Tyr His Met 
180 185 190 

Thr Lys Thr Leu Asp Gin Leu Leu Phe Tyr Tyr Tyr Asn Lys Asn Asp 
195 200 205 

Asn He Gin Val Thr Asp Leu His Gin Gly He Val Trp Gly Thr Asn 
210 215 220 

Thr Asp His Cys Asn Leu His Pro Asp Leu Thr Asn Arg Phe Asp Tyr 
225 230 235 240 

Asp Gly Asp Tyr Gly Thr Val Leu Asn Arg Phe Leu Met Gin Ala Ala 
245 250 255 

He Gly Tyr Pro Leu Thr Val His Gly Val Gly Gly Gin Thr Arg Ala 
260 265 270 

Phe He His He Arg Asp Ser Val Arg Cys Val Gin Leu Ala He Glu 
275 280 285 
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Asn Pro Pro Ala Ala Asn Glu Lys Val Arg He Phe Asn Gin Met Thr 
290 295 300 

Glu Thr Tyr Gin Val Lys Asp Leu Ala Glu Lys Val Ala Ala Leu Thr 
305 310 315 320 

Gly Ala Glu He Ala Tyr Leu Pro Asn Pro Arg Lys Glu Ala Leu Glu 
325 330 335 

Asn Asp Leu He Val Asp Asn Arg Cys Leu He Asp Leu Gly Leu Asn 
340 345 350 

Pro Thr Thr Leu Asp Asn Gly Leu Met Ser Glu Val Val Glu He Ala 
355 360 365 

Gin Lys Phe Ala Asp Arg Cys Asp Arg Ala Lys He Pro Cys Val Ser 
370 375 380 

Ala Trp Thr Arg Asn Gin Ala Glu Ala Leu Ser Ala Pro Glu Thr Ala 
385 390 395 400 

Leu Arg 



<210> 


10 


<211> 


26 


<212> 


DNA 


<213> 


Artificial/Unknown 


<220> 




<221> 


misc_f eature 


<223> 


Synthetic 


<400> 10 

aaaggatccc gtgttatggt cattgg 


<210> 


11 


<211> 


26 


<212> 


DNA 


<213> 


Artificial/Unknown 


<220> 




<221> 


misc_f eature 


<223> 


Synthetic 


<400> 11 

gtcggatcct tatgtggtca tggact 


<210> 


12 


<211> 


24 



- 76 - 



<212> DNA 

<213> Artificial/Unknown 
<220> 

< 2 2 1 > mi s c_f eature 

<223> Synthetic 

<400> 12 

tttggatccc gcatcgctct cttt 

<210> 13 

<211> 23 

<212> DNA 

<213> Artificial/Unknown 
<220> 

<221> mi sc_f eature 

<223> Synthetic 

<400> 13 

ataagcttcg agctcaggcc get 

<210> 14 

<211> 29 

<212> DNA 

<213> Artificial/Unknown 
<220> 

<221> misc_feature 

<223> Synthetic 

<400> 14 

egggatccat gacgactctt tcttctata 

<210> 15 

<211> 28 

<212> DNA 

<213> Artificial/Unknown 
<220> 

<221> misc_feature 

<223> Synthetic 

<400> 15 



aaggatccct acacgttacc ttccggta 

<210> 16 

<211> 28 

<212> DNA 

<213> Artificial/Unknown 
<220> 

<2 2 1 > mi sc_f eature 

<223> Synthetic 

<400> 16 

aaggatccat ggcttcacaa accaaact 

<210> 17 

<211> 28 

<212> DNA 

<213> Artificial/Unknown 
<220> 

<221> misc_f eature 

<223> Synthetic 

<400> 17 

gcggatcctc atattttgaa aaagcact 

<210> 18 

<211> 27 

<212> DNA 

<213> Art if icial/Unknown 
<220> 

< 2 2 1 > mis c_f ea t ure 

<223> Synthetic 

<400> 18 

agggtaccat ggagggattc ggttatc 

<210> 19 

<211> 28 

<212> DNA 

<213> Artificial/Unknown 



<221> 


misc_f eature 


<223> 


Synthetic 


<400> 19 

gcggtacctt aaggtctatg catttgac 


<210> 


20 


<211> 


26 


<212> 


DNA 


<213> 


Art if icial /Unknown 


<220> 




<221> 


misc_f eature 


<223> 


Synthetic 


<400> 20 

ctaggtacca aatggcgcat ctactt 


<210> 


21 


<211> 


26 


<212> 


DNA 


<213> 


Art i f i c i a 1 /Unkno wn 


<220> 




<221> 


misc_f eature 


<223> 


Synthetic 


<400> 


21 



28 



26 



aaggatcctg cgctaaagtc gcactc 2 6 
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